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ABSTRACT 


Different DNA markers have been utilized in the last few decades as important molecular tools in plants for genetic relation stud¬ 
ies among individuals, hybrid and varietal identification, phylogenetic relationship among species, gene mapping and tracking 
quantitative trait loci. These markers can be broadly classified into hybridization and PCR based markers. Restriction fragment 
length polymorphism (RFLP) represents the hybridization based marker; while PCR dependent includes more reliable and 
advanced polymorphic markers like amplified fragment length polymorphism (AFLP), inter-simple sequence repeats (ISSR), sin¬ 
gle nucleotide polymorphism (SNP), sequence related amplified polymorphism (SRAP), start codon targeted (SCoT) and inter¬ 
primer binding site (iPBS) among others. Functional markers (FMs) have also been developed from functionally characterised 
sequence motifs which are superior to random markers due to their complete linkage to trait locus alleles. With the advent of 
next generation sequencing (NGS) technology, excellent opportunities are offered for generating ample structural and functional 
genomic information in many important crops. The novel approach of genotyping-by-sequencing (GBS) employs NGS protocols 
for discovering and genotyping SNPs in many important crop genomes and populations. The present review focuses on the 
description of varied molecular markers, their methodologies, strengths and limitations as well as applications in plant breeding 
and genetic research. 
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INTRODUCTION 

Knowledge of genetic structure and level of variation with¬ 
in and between plant populations is important for effective 
utilization and conservation of plants. Factors determining 
level and structure of genetic variation within plant species 
include evolutionary history characteristics, population 
density, mating system and mechanism of gene flow [1]. 
Although morphological characters, cytological and eth¬ 
nological parameters have been used traditionally to char¬ 
acterize levels and patterns of diversity, these traits alone 
represent only a small portion of plant genome and are 
influenced by environmental factors [2]. These limit their 
utility in describing the potentially complex genetic struc¬ 
tures that may exist within and between taxa [3]. Advances 
in biochemistry and molecular biology during the past few 
decades have helped to overcome these constraints by pro¬ 
viding breeders with many powerful molecular markers 
[4]. Various molecular approaches for detection of genetic 
diversity have been devised in many plants including im¬ 
portant horticulture crops using DNA based markers. These 


markersare generally independent of environmental factors 
and are more numerous than phenotypic characters provid¬ 
ing clearinformation of underlying variation in the genome 
of an organism. The present review aims to describe differ¬ 
ent marker systems which are employed in plant identifi¬ 
cation, crop improvement, genome analysis, phylogenetic 
and population diversity studies. 

HYBRIDIZATION BASED DNA MARKER 

Restriction fragment length polymorphism (RFLP) is the 
only marker system representing hybridization based mark¬ 
er. This involves the use of restriction enzymes and hybridi¬ 
zation of the target fragment by labelled probe. It is based on 
the generation of different size DNA fragments due to diges¬ 
tion by restriction enzymes. Genomes of individuals belong¬ 
ing to same species will differ in DNA fragment production 
after restriction digestion as a result of point mutation, inser¬ 
tion/deletion, translocation, inversion and duplication. RFLP 
steps involve the cutting of genomic DNA by restriction 
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enzyme generating different sized DNA fragments. 6 base 
pair cutter enzymes are most often used for RFLP analysis 
as they are cheaper and readily available and alsogenerate 
product range (200 to 20,000 bp) that can be conveniently 
separated on agarose gels [5]. The separated DNA fragments 
are transferred to nitrocellulose membrane by Southern blot 
technique [6]. Fragments of interest are identified by hybrid¬ 
izing with complementary radioactive labelled probe and 
specific banding pattern is visualized after autoradiography 
The result obtained from RFLP technique depends on both 
restriction enzymes and number of probes. RFLP exhibits 
high reproducibility, codominant inheritance, easy data trans¬ 
ferability between laboratories and provides locus specific 
markers. Disadvantages of the technique are time consum¬ 
ing, requirement of high quality and quantity of DNA, ex¬ 
pensive radioactive probes, involvement of tedious Southern 
blotting method and necessity of prior sequence information 
for developing radio labeled probe. The RFLP markers have 
been employed for genetic diversity and population genetic 
study in varied plants like Quercus phellos [7], Saccharum 
spp.[8] and Vignci radiata [9]. 

PCR BASED MARKERS 

Polymerase Chain Reaction (PCR) technique was invented 
by Kary Mullis in 1983 [10]. It involves the use of Taq DNA 
polymerase obtained from Thermus aquaticus for exponen¬ 
tial amplification of very small amount of target DNA. The 
prior sequence information of flanking sites of the target 
DNA is required to design an appropriate primer for amplifi¬ 
cation of selected nucleotide segment. In addition to primers 
and DNA polymerase, supplementation of deoxynucleotide 
triphosphate (dNTPs) and magnesium ions along with ap¬ 
propriate buffer system are essential in basic PCR protocol. 
The PCR based markers are more advantageous over hy¬ 
bridization method due to less amount of DNA requirement, 
absence of radioisotopes, high reproducibility, more reliable 
and higher polymorphism in short time. 

Amplified fragment length polymorphism 
(AFLP) 

The technique employs both RFLP and PCR by ligating 
primer recognition sequences to the DNA fragments pro¬ 
duced through restriction digestion [11]. Simultaneous 
screening of representative DNA regions distributed ran¬ 
domly thr oughout genome is the important feature of AFLP. 
Polymorphisms of AFLP may be produced due to mutation 
at restriction site, insertions, duplications or deletions inside 
amplification fragments and mutations of the sequences 
flanking the restriction sites. Good quality DNA as well as 
partially degraded DNA can be used for AFLP analysis but 
DNA should be free from restriction enzymes and PCR in¬ 
hibitors. The genomic DNA might be digested by restriction 


enzymes which are the combination of rare cutter (Eco RI or 
Pstl) and frequent cutter (Msel or Taql). The double stranded 
oligonucleotide adaptors which do not bear initial restriction 
sites after ligation are developed and ligated to both ends 
of the fragments to give known sequence for PCR. PCR is 
first performed with primer combinations containing a sin¬ 
gle base pair extension while final amplification is carried 
out by using primer pairs with upto 3 base pair extension. 
AFLP fragments generated are visualized either on agarose 
gel or on denaturing polyacrylamide gel with autoradiogra¬ 
phy or AgN0 3 staining respectively. The method is highly 
reproducible and its ability to detennine high polymorphism 
in a single reaction makes AFLP one of the most sought after 
molecular tools for genetic analysis [12]. The limitations are 
the requirement of high molecular weight purified DNAs, 
possibility of co migrating non-homology fragments belong¬ 
ing to different loci. 

Inter simple sequence repeat (ISSR) 

This technique was first reported by Zietkiewicz at al. [13] 
and involved amplification of DNA segments present at an 
amplifiable distance between two identical microsatellite re¬ 
peat regions oriented in opposite directions. Microsatellites 
of di, tri, tetra or penta-nucleotide core sequences are used as 
primers to amplify mainly inter simple sequence repeats of 
different sizes. ISSR primers are longer (15-35 mers) which 
allow higher annealing temperature resulting in higher strin¬ 
gency. The annealing temperature however depends on the 
GC content of the primer. After PCR amplification the am¬ 
plified products of 200 to 2000bp long are separated through 
agarose or polyacrylamide gel electrophoresis and the result¬ 
ing ISSR banding pattern can be visualized through auto¬ 
radiography or AgN0 3 staining. Prior sequence information 
of template DNA is not required for generating ISSR poly¬ 
morphism. ISSR markers are simple, randomly distributed 
in the genome, exhibit mostly dominant inheritance pattern 
and require low quantity of DNA. They are employed in spe¬ 
cies and plant varietal identification, taxonomic and genetic 
diversity studies, gene mapping and clonal fidelity testing 
of in vitro derived plants[14]. The main limitations of ISSR 
marker are low reproducibility and homology of co-migrat- 
ing amplification products [15]. 

Single nucleotide polymorphism (SNP) 

It is the new generation molecular marker which detects pol¬ 
ymorphisms among individuals due to changes in single nu¬ 
cleotide position. The simplest approach to discovering SNP 
in targeted region containing genes is performing direct se¬ 
quencing of genomic PCR products derived from varied in¬ 
dividuals. This approach may be costly for large scale study 
as there is a need for locus-specific primers and is limited 
to regions for which data are available. The other method 
based on the comparison of sequences obtained from cloned 
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fragments can be considered for developing SNP map of the 
genome [16]. Many SNP genotyping methods are developed 
which combine two elements: generation of an allelic prod¬ 
uct and analysis thereof [16]. Sobrino et al. [17] assigned 
the majority of SNP genotyping assay to one of four groups 
based on molecular mechanism such as allele specific hy¬ 
bridization, primer extension, oligonucleotide ligation and 
invasive cleavage. Allele specific hybridization is based on 
distributing between two DNA targets differing at one nu¬ 
cleotide position by hybridization [18]. Two allele-specific 
probes are designed with polymorphic base in a central posi¬ 
tion in probe sequence. Under optimized assay conditions 
perfectly matched probe target is stable while one base mis¬ 
match is unstable. Most hybridization techniques are based 
on Dot Blot and Reverse Dot Blotmethods. Main advantages 
of the marker system are possibility of using small and ex¬ 
tremely degraded DNA sample, high genomic abundance, 
completely automated sample processing and generation 
of no stutter products. The SNPs are found to be associated 
with plant genes of economic values e.g., presence of SNP 
markers for waxy gene controlling amylose content in Rice, 
linking of SNP with dwarfing gene in Rice and male sterility 
in Onion [19]. 

Diversity arrays technology (DArT) 

This technique is based on microarray hybridization involv¬ 
ing simultaneously genotyping of several hundred poly¬ 
morphic loci spread over the genome [20]. Genomic repre¬ 
sentations are set up by genomic DNA restriction digestion 
followed by ligation of restriction fragments to adaptors for 
each of the individual DNA sample. Primers designed for 
adaptors and selective overhangs are used in PCR amplifica¬ 
tion in order to reduce genome complexity. DNA fragments 
derived from the representations are cloned and clone ampli¬ 
fication is performed using vector-specific primersfollowed 
by purification and finally arrayed onto a solid support (mi¬ 
croarray) forming a discovery array [15]. Labeled genomic 
representations formed from individual genomes included 
in the pool are hybridized to discovery array. Polymorphic 
clones exhibiting differential hybridization signal intensities 
are later set up into a “genotyping array” for routine genotyp- 
ing[20].The advantages of this marker are high reproducibil¬ 
ity, readily expandable, quick and non-requirement of prior 
sequence information. They are technically more demanding 
with long tedious steps and require skilled personnel being 
microarray-based technique. High investment in terms of 
time, physical energy, cost and advanced lab facilities limit 
the application of DArT markers in plant research. 

Conserved DNA derived polymorphism (CDDP) 

The technique was developed by Collard and Mackill [21] 
to fingerprint rice varieties. It was also used to evaluate the 
genetic diversity existed in Solanum dulcamara [22]. The 



short conserved sequences found in plant genome in multi¬ 
ple copies are targeted by primers designed to bind to these 
genes and generate polymorphic banding patterns. The prim¬ 
ers also target common plant genes which are related to abi¬ 
otic and biotic stress or responsible for plant development. 
Variation can be detected as length polymorphism within 
these regions since highly conserved DNA regions sharing 
the same priming site but differ in their genomic distribu¬ 
tion. Single long primer amplifications with a high annealing 
temperature improve the reproducibility in CDDR The DNA 
fragments produced are in the range of 200-1500 bp which 
are separated by electrophoresis and the banding patterns ob¬ 
served after autoradiography. 

Start codon targeted (SCoT) 

This is a novel marker system developed based on the short 
conserved regions flanking the ATG start codon in plant ge¬ 
nome. SCoT involves the use of 18-mer primers with an¬ 
nealing temperature at 50°C [21], Single primer is used in 
PCR which means the same primer is utilized as forward and 
reverses primer as in RAPD and ISSR markers. These mark¬ 
ers are generally reproducible and the length and annealing 
temperature are not the most important factors determin¬ 
ing reproducibility [23]. They are dominant markers which 
can be used for plant genetic analysis, quantitative trait loci 
(QTL) mapping and bulk segregate analysis [21], The PCR 
products generated after amplification reaction are subjected 
to general agarose gel electrophoresis to separate the frag¬ 
ments and the bands are detected through autoradiography. 

Sequence related amplified polymorphism 
(SRAP) 

This simple, inexpensive, dominant marker technique was 
developed by Li and Quiros [24]. They developed the marker 
to specifically amplify coding regions of genome of Brassica 
oleraceae by targeting GC rich exons and AT rich promot¬ 
ers, introns and spacers. Forward primers are designed to 
contain GC rich sequences near the 3 ’ end whereas reverse 
primers contain AT rich sequences at the 3 ’ end. “CCGG” 
sequences are present in the core of forward primers while 
the “AATT” sequences represent the core of reverse primers 
[25]. The PCR amplification products are resolved follow¬ 
ing agarose gel electrophoresis and the bands are visualized 
through autoradiography. The DNA fragments are scored by 
simple absence or presence of bands as done in ISSR and 
RAPD markers. SRAPs have been employed to evaluate ge¬ 
netic variation at species level, in population genetic analysis 
of closely related hybrids, construction of linkage maps and 
in identification of quantitative trait loci [26]. 

Inter-Primer binding site (iPBS) 

Use of retrotransposons as molecular marker has limitation 
due to requirement of sequence information of LTR to design 
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element specific primers. There is necessity for numerous 
cloning and sequencing steps to obtain a few good primer 
sequences. Most LTR regions do not have conserved motif 
for direct amplification by PCR [27]. Kalender et al. [28] 
developed iPBS marker for identifying diverse LTR se¬ 
quences and directly visualising the polymorphism among 
the plant cultivars. This method utilizes PBS (primer binding 
sites) which is a conserved sequence located adjacent to the 
5’ LTR and is universally present in all retrotransposons. The 
tRNA binds to PBS region to initiate reverse transcription 
by producing complementary base pairing between tenninal 
sequence of tRNA and conserved region of PBS [29]. Prim¬ 
ers can be designed which match the conserved region of 
PBS and DNA fragments of diverse LTR sequences can be 
generated after PCR amplification [27]. The cloning of LTR 
sequences was previously dependent on conserved protein 
coding domains limiting the screening of autonomous ele¬ 
ments [30]. But iPBS technology allows screening of diverse 
LTR sequences while performing effective DNA fingerprint¬ 
ing. The level of polymorphism generated by iPBS ampli¬ 
fication is as efficient as that of IRAP and RBIP markers. 
TheiPBS markers are highly effective and reliable for poly¬ 
morphism detection and determination of clonal differentia¬ 
tion arising out of varied retrotransposon activities and retro- 
transposon recombinations [28]. iPBS is also highly reliable 
and powerful DNA finger printing technology which does 
not require the knowledge of sequence information [31]. It 
is used in phylogenetic and genetic diversity studies of num¬ 
ber of plant species. Genetic diversity studies have also been 
successfully performed using iPBS markers in Apricot, Lens 
species and field pea [31, 32] 

FUNCTIONAL MARKERS (FMS) 

Morphological markers previously employed in plant breed¬ 
ing are easily affected by the environment apart from their 
limited number and difficulty in scoring [33]. The introduc¬ 
tion of random DNA markers RFLP, SSR,AFLP and SNPs 
have brought significant improvement in plant breeding 
programs for producing improved crop varieties. They are 
used for hybridity confirmation, parents and progenies iden¬ 
tification, evolutionary relationship study, genetic diversity 
detennination between and within populations and mapping 
genes for marker assisted selection (MAS). However, the use 
of such markers as diagnostic tool for MAS in plant breed¬ 
ing has limitations because of false positives produced by 
genetic recombinations[34]. Genetic recombination between 
the marker and target locus impairs the transfer of marker 
information from experimental mapping population to un¬ 
related breeding materials [35]. The random markers are 
derived at random from polymorphic sites in the genome 
and are developed independent of their relationship to any 
phenotypic characters. But functional markers (FMs) are de¬ 


veloped from polymorphic sites within the genes casually 
involved in producing phenotypic trait variation [33]. The 
absence of recombination for FM and its complete linkage 
to desired allele prevent information loss and false selection 
in MAB [36], This also makes FM more effective in iden¬ 
tification and selection of favourable alleles and enhances 
its diagnostic capability. Unlike random DNA markers, the 
phenotypic validation in MAB is not required while using 
FMs in plant breeding [33]. When random markers are used 
in MAB, there is possibility of transferring target gene along 
with unwanted genes located at a distance due to linkage 
drag producing undesirable phenotypic traits[37]. 

Functional marker development 

The development of functional markers requires function¬ 
ally characterised genes and identification of phenotypic/ 
functional sites affecting plant phenotypic traits. One of the 
challenges of functional marker development is to relate 
sequence polymorphic of functional motif with phenotypic 
trait variation [33]. Association studies identify genes and 
even functional motif within the genes that affect phenotypic 
characters [38], But this approach is dependent on linkage 
disequilibrium (LD) mapping which rely on non-random oc¬ 
currence of allele haplotype in the genome[39]. Low level 
of LD is essential for determining the effects of intragenic 
polymorphism on phenotypic changes. Association stud¬ 
ies can identify sequence motifs affecting phenotypic trait 
expression in crops having low LD. Application of associa¬ 
tion studies may not suffice the distinction of causative from 
phenotypically neutral polymorphism in haplotype structure 
[38]. Functional markers developed through the involvement 
of association studies are termed as indirect functional mark¬ 
ers (IFM) as only indirect (statistical) evidence of sequence 
motif function can be provided [33]. However, the direct 
proof of sequence motif function can be achieved after com¬ 
paring the isogenic genotypes which differ in single sequence 
motif. These isogenic genotypes can be generated through 
the approach of TILLING and homologous recombinations 
[33], The development of FMs for selected traits has been 
observed successfully in few crops. Iyer and McCouch [40] 
cloned gene xa-5 in rice which made it possible to develop 
functional markers for Xa5-mediated resistance to bacterial 
blight disease [41]. Several candidate genes for FM develop¬ 
ment which have potential for controlling agronomic traits 
have been cloned in different plant crops - Dwarf 8 in Maize 
[38] and OsARD2 in Rice [42], 

GENOTYPING-BY-SEQUENCING (GBS) 

The traditional DNA sequencing technologies could not 
meet the demand for indebt sequence information required 
in complex genomic research. With the advent of next gen¬ 
eration sequencing technology (NGS), the knowledge gap is 
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filled and the technology has become an inevitable everyday 
research tool in complicated genome studies [43]. Before the 
NGS came into being, Sanger or dideoxy sequencing was 
the most widely used DNA sequencing technique. But the 
Sanger sequencing is relatively expensive, laborious and 
time consuming apart from its inherent limitation of requir¬ 
ing in vivo amplification of DNA fragments to be sequenced. 
The limitations are overcome with the introduction of 454 
technology in the market which was the first NGS technol¬ 
ogy which relied on the method of in vitro DNA amplifica¬ 
tion known as emulsion PCR [44]. 454 platfonn marketed 
by Roche Applied Science has the capability of generating 
80-120 Mb of sequence in 200-300 bp reads in 4 hour runs 
and is one of the most popularly used sequencing technolo¬ 
gies in genome research. Recent advancements in NSG 
technologies make it possible the use of SNPs for genetic 
analysis to a new level. Genotyping-by-sequencing (GBS) 
first introduced by Elshire et al. [45] is a novel application of 
NSG protocols for discovering and genotyping SNPs in crop 
genome and populations. This is highly multiplexed system 
for constructing reduced representation libraries for the II- 
lumina NGS platform generating good numbers of SNPs for 
employing in genetic analysis and genotyping [46]. GBS is 
a cost effective and efficient technique for genomics assisted 
breeding in varied crops like Cotton, Brassica, Sorghum and 
Miscanthus [43]. Using ion PGM systems, two GBS strate¬ 
gies have been developed [47]. The first approach is ideal for 
discovering new markers for MAS programs and no specific 
SNPs are identified with digestion by restriction enzymes. 
Genome complexity is reduced with DNA restriction diges¬ 
tion using one or two selected restriction enzymes before 
adapter ligation [48]. In the second approach, a set of SNPs 
is defined for particular genome region using PCR primer 
designed to amplify only the selected interest region of ge¬ 
nome [47]. The use of sequencing restriction site associated 
genomic DNA (RAD) for discovery of high density SNP and 
genotyping is more complex and expensive as compared to 
GBS. 


CONCLUSION 

The varied molecular markers employed in plant research 
have been discussed along with their applications in genetic 
variability evaluation, genome fmgerprintingand population 
genetic studies. Hybridization based DNA marker which is 
one of the pioneer markers for plant genetic diversity studies 
have been superseded by the development of more reliable 
and reproducible PCR based markers. However, either sim¬ 
ple or more advanced molecular markers possess inherent 
strengths and weakness and none of them are perfect with 
shortcomings. Selection of the most appropriate marker 
will ultimately depend on the particular research approach 
adopted and degree of resolution and polymorphism required 


for the specific study. With rapid progress in molecular bi¬ 
ology, more effective and superior markers may appear in 
near future which can significantly accelerate plant breeding 
research. The recent advancement of NGS technology which 
led to the development of GBS will also provide ultimate 
MAS tool for rapid enhancement in plant improvement and 
crop breeding. 
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